Lexicon expansion using pronunciation variations extracted on the basis of speaker-related deviation in recognition error statistics

نویسنده

Yoshifumi Onishi

چکیده

We propose a novel method for lexicon expansion using pronunciation variations extracted on the basis of speaker-related deviations in ASR error statistics. Two types of pronunciation variations were extracted: common pronunciation variations found with most speakers, and speaker-related pronunciation variations, identified on the basis of recognition error elements weighted by idf and tf-idf measures. Experimental results for CSJ show that entries added to the lexicon from speaker-related pronunciation variations were more effective than those generated on the basis of common pronunciation variations, some of which were superfluous.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multiple-Pronunciation Lexical Modeling Based on Phoneme Confusion Matrix for Dysarthric Speech Recognition

In this paper, we propose speaker-dependent multiple-pronunciation lexical modeling for improving the performance of dysarthric automatic speech recognition (ASR). For each dysarthric speaker, a phoneme confusion matrix is first constructed from the results of phoneme recognition. Then, pronunciation variation rules are extracted by investigating the phoneme confusion matrix, and they are incor...

متن کامل

Pronunciation lexicon adaptation for TTS voice building

This paper describes reducing phone label errors in TTS voice building by means of modeling of speaker pronunciation variants. Each speaker has his or her own unique pronunciations (and context-dependent variations), so that no one standard lexicon is able to cover all of the speaker’s variations. Creating speaker-dependent pronunciation lexicons for automatic speech labeling of our TTS voice d...

متن کامل

Modelling pronunciation variations in spontaneous Mandarin speech

Pronunciation in spontaneous Mandarin speech tends to be much more variable than in read speech. In current recognition systems, pronunciation dictionaries usually only contain one standard pronunciation for each word, so that the amount of variability that can be modelled is very limited. Most recent research work for modelling variations in spontaneous speech focuses on the lexicon level, whi...

متن کامل

Pronunciation Adaptation For Disordered Speech Recognition Using State-Specific Vectors of Phone-Cluster Adaptive Training

Pronunciation variation is a major problem in disordered speech recognition. This paper focus on handling the pronunciation variations in dysarthric speech by forming speaker-specific lexicons. A novel approach is proposed for identifying mispronunciations made by each dysarthric speaker, using state-specific vector (SSV) of phone-cluster adaptive training (Phone-CAT) acoustic model. SSV is low...

متن کامل

Data-driven Pronunciation Modelling for Non-native Speakers Using Association Strength between Phones

In this paper we present an approach to modelling pronunciation variation, particularly for non-native speakers, by modifying the lexicon. In this way we can model several speakers simultaneously, i.e. use the same lexicon and the same acoustic models for all speakers. We use a data-driven approach, i.e. methods based solely on the reference lexicon, the recognizer’s acoustic models, and the ac...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2008

Lexicon expansion using pronunciation variations extracted on the basis of speaker-related deviation in recognition error statistics

نویسنده

چکیده

منابع مشابه

Multiple-Pronunciation Lexical Modeling Based on Phoneme Confusion Matrix for Dysarthric Speech Recognition

Pronunciation lexicon adaptation for TTS voice building

Modelling pronunciation variations in spontaneous Mandarin speech

Pronunciation Adaptation For Disordered Speech Recognition Using State-Specific Vectors of Phone-Cluster Adaptive Training

Data-driven Pronunciation Modelling for Non-native Speakers Using Association Strength between Phones

عنوان ژورنال:

اشتراک گذاری